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FIELD OF THE INVENTION 

The invention relates to a method of distributing music from a service provider 
to a listener. The invention also relates to an apparatus for distributing music from a service 
provider to a listener. The invention also relates to a method of playing back a distributed 
audio signal, and finally an apparatus for playing back a distributed audio signal. 



BACKGROUND OF THE INVENTION 

Recently, the distribution of digital music over the Internet has gained in 
popularity and visibility. This is due, in part, to the widespread use of the MP3 music 
compression format, which permits relatively high quality reproduction of music using music 
files that are small enough to be downloaded by many home users. A typical three-minute 
song may be compressed into a file having a size of approximately 2.5 to 3 megabytes using 
MP3, while retaining sound quality comparable to the quality of music played from a 
compact disc. A home user connected to the Internet using a low cost modem capable of 
transferring 56 kilobits per second can download the song in a couple of minutes. 

Numerous applications for playing digital music have been developed for use 
on personal computers. Additionally, portable digital music playback devices have been 
developed, which can download MP3 encoded music from a personal computer, and play the 
downloaded music. These portable music players have numerous advantages over previously 
known portable music players, such as portable cassette players or portable CD players. For 
example, since portable digital music players would typically store music in solid state 
memory, there are no mechanical parts that may causaskipping or other audible glitches in 
the music when the device is used during jarring physical activities. Additionally, portable 
digital music players can be made very small, and are not confined to a particular form factor 
by the media that they play. 

These advantages have made portable digital music players very popular, with 
several companies manufacturing a variety of models that play MP3 encoded music. 

MP3 players are also being built for use in automobiles, and as part of a home 
audio system. These digital music players often use a hard drive or a CDROM to store music. 
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The primary advantage of such devices is their ability to store a large amount of music in a 
small space. For example, at approximately 1 megabyte per minute of CD-quality music 
stored in MP3 format, a single CD-ROM can store more than 10 hours of high quality music. 
Hard drives conforming to the size of a 3.5" drive bay on a personal computer, and having 

5 capacities exceeding 20 gigabytes, are now available inexpensively. Such a drive could store 
more than 300 hours of high quality MP3 encoded music. 

The ability to distribute digital music over a network, such as the Internet, 
offers several potential advantages to consumers and artists. 

Gensu moro oon oho es o specific - song s that they wish t o purchase, pay for th o se - 

4-0 songSj-and-d&vvnload-therBrOver-the-Iaternet. T he r e is-no-need to tr a ve l to a mu s ic , s tore, a n d — 

the music can be delivered through a modem or other communication device within a few 
minutes. Since there is no manufacturing or packaging cost and only minimal distribution 
cost involved in providing digital music, the prices charged for digital music distributed over 
the Internet could be lower than what is charged for music distributed by other means, while 

1 5 still maintaining a relatively high profit margin. Once downloaded, digital music, such as 
MP3 formatted music, provides the consumer with a high degree of flexibility. The music 
may be stored in a searchable database, and played on any of a number of devices. 

MP3 files provide no technological means of preventing perfect copies from 
being distributed illegally over the Internet. There is nothing in 1he MP3 format that prevents 

20 unauthorized copies from being played, that identifies the copyright holders or the licensee or 
that in any way ensures that the copyright holders will be compensated for use of the music. 

As a result of this lack of copyright management, the recording industry has 
actively opposed the wide scale adoption of MP3 as a format for distributing music over the 
Internet. Instead, the industry has partnered with technology companies to form the Secure 

25 Digital Music Initiative (SDMI), to develop a digital music format with acceptable copyright 
management. So far, there has been considerable disagreement among SDMI members, and 
although a proposed standard has been announced, there are not yet any digital music players 
that implement the SDMI format. Due to the installed base of MP3 users and the growing 
popularity of MP3, it is unclear whether the recording industry's efforts to impose its own 

30 music formats will be successful. 

The recording industry is also concerned about issues of retaining control over 
music content, and over marketing issues. 

The participation of the recording industry in the distribution of digital music 
may be vital to the future of both the recording industry and the nascent digital music 
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industry. Since the major record labels hold the rights to almost all popular music, and have 
contracts with almost all popular artists, and since it is important for them to hold on to these, 
rights, it is unlikely that much popular music by major artists will be distributed in 
compressed digital format unless the concerns of the recording industry are addressed. At the 
5 same time, consumers may be unwilling to pay for digital music players that place severe 
restrictions on the availability of digital music, and for digital music that offers no price, 
quality, or selection advantages over what is available on other media, such as compact disks. 

The invention also relates to other types of digital signals comprising audio, 
e.g. the wma-format or other signals comprising audio such as video (e.g. mpeg-format, avi- 
1 0 format) and computer games. 

OBJECT AND SUMMARY OF THE INVENTION 

It is an object of the invention to provide advantageous audio distribution 
whereby the risk of unauthorized copying of distributed signals comprising audio data, such 

1 5 as music and video, is reduced. 

This is obtained according to the present invention by a method for a service 
provider to distribute an audio signal to a listener, where, before the listener gets access to the 
audio signal, the audio impression of the audio signal is personalized to said listener, said 
personalizing being performed by filtering said audio signal using a set of head related 

20 transfer functions comprising listener parameters being specific for said listener. 

Thereby, even if the audio data could be copied technically, the resulting 
sound experience is never optimal for anyone but the original listener. Further, after 
downloading the audio data, the audio has been provided with listener specific information 
being a kind of watermark, whereby it is possible to identify the listener by comparing 

25 identified parameters in audio data with parameters being stored in a database. A further 

advantage is that the service provider does not need to expose the original audio data to other 
parties, such as especially copyists. A further use of the invention is that HRTF jBltered audio 
can be sold at a low price and then later the listener might want to buy the original audio. 

In an embodiment, the personalization is performed before distributing said 

30 audio signal to said listener. Thereby, the service provider is assured that only personalized 
audio is distributed, minimizing the risk of unauthorized persons getting access to the original 
audio signal. 

In a specific embodiment, the listener parameters in the set of head related 
transfer functions have been chosen between different sets of listener parameters being 
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specific for said listener. Thereby, the listener can select an audio impression of specific 
interest. This could e.g. be a specific seat in the Carnegie Hall, at the front row of Wembley 
stadium, etc. 

In another embodiment, the set of head related transfer functions has been 
5 modified in a substantially inaudible way, where said modification is performed by 

embedding information into the set of head related transfer functions before being used for 
filtering the audio signal. Thereby information can be added providing information that e.g. is 
to be interpreted by the playback device. The information could e.g. be an expiry date of the 

audio signal, after which the audio signal may no longer be played back. 

-t0 T he in v en tion also r elate s to a -jt nelhod ofpla y ing back rardisteilreted^ 

signal, wherein the audio impression of the audio signal has been changed according to first 
listener parameters being specific for a specific listener, comprising the steps of: 

- detecting said first listener parameters used for changing the audio impression of said 
audio signal, 

15 - comparing said detected first listener parameters with second listener parameters, 

- playing back said audio signal if said detected first listener parameters identify a listener 
being identical to the listener identified by said second listener parameters. 

Thereby the playback device can use the modification as a watermark and 
determine whether the audio signal has been bought by the owner/user of the playback 
20 device. The modification can be specific listener parameters such as listening room acoustics, 
type of speaker, range of hearable frequencies, etc. 

In a specific embodiment the steps of: 
detecting said first listener parameters used for changing the audio impression of said 
audio signal and 

25 - comparing said detected first listener parameters with second listener parameters, 

are performed by comparing the audio signal that has been changed according to said first 
listener parameters with a corresponding audio signal having been changed according to said . 
second listener parameters. This is a simple way of detecting and comparing the audio. 

In an embodiment, the first and second listener parameters are parameters to 

30 be used in a set of head related transfer functions, and wherein the audio signal has been 
changed by filtering it using the set of head related transfer functions having listener 
parameters being specific for a specific listener. The HRTFs are very listener specific 
functions, which are unique for each listener, therefore a fingerprint is provided on the music, 
making it possible to clearly identify the right listener. 
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In a specific embodiment the steps of: 

- detecting said first listener parameters used for changing the audio impression of said 
audio signal and 

- comparing said detected first listener parameters with second listener parameters, 

5 are performed by comparing the frequency spectrum of the audio signal having been filtered 
by the set of head related transfer functions having said first listener parameters and the 
frequency spectrum of the set of head related transfer functions having said second listener 
parameters. This can be a very reliable way of detecting and comparing and it does not 
require access to the original audio signal. 
10 In a specific embodiment the method further comprises the step of: 

• detecting information having been embedded into the head related transfer function 
before filtering the audio signal and, if the detected first listener parameters identify a 
listener being identical to the listener identified by said second listener parameters, 
playing back the audio signal according to the detected information. 
15 Thereby, information that has previously been embedded by e.g. a service 

provider can be detected and an action can be taken according to the information. An action 
could be not playing back the audio because of the embedded information. 

The invention further relates to an apparatus for playing back a distributed 
audio signal, wherein the audio impression of the audio signal has been changed according to 
20 first listener parameters being specific for a specific listener, comprising: 

- means for detecting said first listener parameters used for changing the audio impression 
of said audio signal, 

- means for comparing said detected first listener parameters with second listener 
parameters stored locally, and 

25 - means for playing back said audio signal if said detected first listener parameters identify 
a listener being identical to the listener identified by said second listener parameters. 
WO 01/24576 discloses remote audio processing dependent on hearing 
profiles for persons with hearing problems and the use of psycho acoustical processing. In 
this document the processing is not for copy protecting audio, but only for optimizing the 
3 0 sound for listeners with hearing disabilities. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the following preferred embodiments of the invention will be described 
referring to the figures, where 
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figure 1 illustrates the method according to the present invention, 
figure 2 illustrates an apparatus according to the present invention, 
figure 3 shows how the HRTF information in audio data can be used as a 

watermark for identifying the listener of the audio data and 
5 figure 4 illustrates a method of playing back distributed audio data using a 

playback device. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

I n 1ho-p r c3e nt - m ve ntion - ^ functions (IIRTF3) is 

40 usedr4o-peraenaUze^di04€Ha^ 

describing how sound propagates from a specific sound source to the ear, and the number of 
HRTFs belonging to a set could be from one HRTF describing sound propagation from a 
source to the ears to a number of HRTFs depending on the number of sources delivering 
sound and the number of ears. 

15 In the following HRTF is defined. By finding the sound pressure produced by 

an arbitrary source at the eardrum (taking into consideration parameters such as the distance 
between the ears and the shape of the outer ear), all that is needed is the impulse response 
from the source to the eardrum, which e.g. can be measured by placing a microphone in the 
ear. This is called the Head-Related Impulse Response, and its Fourier transform is called the 

20 Head Related Transfer Function (HRTF). The HRTF captures all of the physical cues to 
source localization. Once you know the HRTF for the left ear and the right ear, you can 
synthesize accurate binaural signals from a monaural source. The head related transfer 
function is well known and is described in a number of documents, such as (Blauert, Spatial 
hearing: The Psychophysics of Human Sound Localization (MIT Press, Cambridge, MA, 

25 1 983). When audio is filtered by a set of HRTFs, the audio is optimized for the person to 
whom the set of HRTFs belongs, and therefore the sound experience is never optimal for 
anyone but the person to whom the set of HRTFs belongs, .The HRTFs are filter functions 
with parameters or coefficients being specific for specific persons. For a specific person 
different sets of HRTFs can be obtained depending on the arbitrary source mentioned above. 

30 The HRTFs depend among other things on the distance between the source and the person 
and also the characteristics of the room in which the function parameters are. When the 
source is e.g. a headphone, the HRTFs depend on the headphone through which sound 
reproduction takes place. The result of filtering audio using this function is that an optimal 
spatial reproduction of surround audio in headphones is obtained. The source could also be a 
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typical loudspeaker; in this case it is necessary to perform cross-talk cancellation, which can 
be based e.g. on the HRTF. 

Figure 1 illustrates the method according to the present invention; a listener 
101 delivers data 103 to be used for generating a set of head related transfer functions 
(HRTFs) to a service provider 105. The listener requests audio data 107 such as apiece of 
music or other multimedia data comprising audio such as a movie or a computer game. The 
service provider 105 then reads audio data 109 from a storage medium 111 and filters the 
data using the listener specific set of HRTFs. The filtered data 1 13 are then distributed to the 
listener 101 and the listener 103 can now play back the data and experience the optimized 
audio. 

The listener could obtain the listener data in e.g. music stores, at home, 
warehouses or another measuring location, where equipment for measuring the HRTF 
parameters is available. Further, the measuring could be performed using a network 
connection between a measuring location and the listener. When measuring the HRTF 
parameters, the listener can choose, which environment the audio experience should simulate, 
this could e.g. be a specific seat in the Carnegie Hall, at the front row of Wembley stadium, 
etc. In this case, the listener will not necessarily have to be present at that particular 
environment; the measuring location might already have general data about acoustics at these 
places, which then can be incorporated in the listener measuring process. A number of 
different sets of HRTF parameters can thereby be generated for each listener, and when the 
listener requests the audio data, he also makes a selection of which environment the audio 
experience should simulate. 

Figure 2 illustrates an apparatus according to the present invention. In a 
specific embodiment the service provider is an Internet service provider (ISP). The ISP offers 
multimedia data comprising audio to listeners by using a public communication network, 
such as the Internet. In order for the listener to be able to buy music, it is necessary that the 
ISP has at least one set of listener specific parameters to be used in a set of HRTFs. 
Parameters for simulating the environment are then used in combination with the listener 
specific parameters when generating the set of HRTFs, and a number of environment specific 
HRTF parameters can then be obtained for each listener. The listener transmits the obtained 
HRTF parameters 203 to the ISP using a transmitter. In this specific case the transmitter is an 
I/O adapter in a computer communicating with the ISP over ttie Internet. 

The ISP then receives the parameters using a receiver 207; the receiver could 
e.g. be an I/O adapter in a server communicating with the transmitter over the Internet. In an 
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embodiment, the parameters are stored in a database 209. This database could comprise 
parameters from a number of listeners, which are identifiable e.g. by a listener ID. Further, 
the database could also comprise different sets of parameters for one listener, the parameters 
being specific for different environments as mentioned above. Then, when the listener 
5 requests audio data e.g. using a web page belonging to the ISP, the listener enters an ID and 
identifies the requested audio data. The ISP then locates the HRTF parameters of the listener 
according to the ID, whereafter the ISP locates the audio data in a database 211. The 
identified audio data are filtered 213 using the HRTF parameters of the listener. A transmitter 
215 tr ansmit? the filtered audio d ata the listener over the. Internet, which is received at the 

1 0 listener by th» receiver Again the receiver could e.g. be an I/O adapte r in a computer 

communicating with the transmitter over the Internet, and the transmitter an I/O adapter in a 
server communicating with the listener over the Internet. The receiver 217 stores the received 
data in a database 219. The audio data are now available to the listener for playback by a 
playback device 219 receiving data from the database 221 . The playback device could be any 

1 5 device, such as a portable DVD, CD, MP3 player etc. Because of the above, the processed 
audio has characteristics that are unique for one listener. Therefore, even if the result could 
be copied technically, the resulting sound experience is never optimal for anyone but the 
original listener. Further, after downloading the audio data, the audio has been provided with 
listener specific information being a kind of watermark, whereby it is possible to identify the 

20 listener by comparing identified parameters in audio data with parameters being stored in e.g. 
the database 209. The set of HRTFs could also be modified in an inaudible way, making it 
possible to embed information into the set of HRTFs. The information could e.g. be a date 
until which the listener is allowed to playback the audio data. 

By performing the filtering of the audio data using a set of HRTFs before the 

25 data is downloaded over the Internet to the listener, two or more sound channels are 

•mapped' to exactly two signals representing the left and right speaker. Thereby there is only 
need for two channels when downloading. 

The invention is not limited to distribution over the Internet. The service 
provider could also send a CD to the listener after the audio content of the CD has been 

30 personalized using HRTF. Further, the listener might be the service provider himself, and 
when he makes copies of an original CD, then the copies are filtered using listener specific 
HRTFs. Thereby the audio is optimized for the listener, and further the audio is personalized 
making it less interesting for other users. The HRTF functionality could be incorporated in a 
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CD recorder device in such a way that it is only possible to record audio after the audio has 
been filtered by a listener specific HRTF. 

In figure 3 it is shown how the HRTF information in audio data can be used as 
a watermark for identifying the listener of the audio data. First of all, the original content is 
5 identified 301 using hashing, such as fingerprinting. The HRTF parameters can then be 
extracted from the audio data by comparing the audio data with the original audio data. 
Another method of identifying 301 the HRTF parameters could also be the searching for 
features in the frequency spectrum of the audio data and comparing these with notches in a 
different set of HRTFs. After the HRTF parameters have been extracted, the listener can be 

10 identified by comparing 303 the extracted listener specific HRTF parameters with data in a 
database 305 linking HRTF parameters with listener ID's. This database could in one 
embodiment be the database 209 at the ISP mentioned in figure 2. The whole identification 
process could be performed by a microprocessor executing a program designed for 
performing the above step. 

15 In figure 4 a method of playing back distributed audio data by a playback 

device is illustrated. Before playing back the audio, the set of HRTFs that has been used for 
filtering the audio is extracted 401 . This can be done as mentioned in the description of figure 
3. The detected HRTF parameters are then compared 403 to a listener specific HRTF 405 
(UJHRTF) stored on the device, and linking the listener to the device. If the extracted set of 

20 HRTFs equals the listener specific set of HRTFs, the audio has not been copied illegally, and 
the audio data can be played back 407 by the playback device. Otherwise, the audio data is 
deemed to be illegal, and the playback process is ended 409. The playback device could be 
any device connected to headphones or loudspeakers, such as a portable DVD, CD, MP3 
player etc, where the device comprises processing means, such as a microprocessor for 

25 performing the HRTF detection. 

In a specific embodiment, the playback device further comprises means for 
detection and interpretation of the. additional information that has been embedded in the set of 
HRTFs. One application of this could for example be to detect the date until which the 
listener is allowed to play back the audio signal, and if the date has expired, then the 

30 playback device will not playback the audio signal. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
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word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
5 item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 

In summary, the present invention relates to a method for a service provider to 
dist ribute an audio signal a listenex-Befhre delivering the, midio sipna.1 to the listener, the 

4CU^udio-impr^ssiQn-ofjhe audio signal is personalized to the listener. The personalizing_£Quld^ 
e.g. be performed by filtering the audio signal using a set of head related transfer functions 
(HRTFs) defined by parameters being specific for the listener. This is for minimizing the 
interest of performing illegal copying of the audio signal. The invention further relates to a 
method of playing back a distributed audio signal and a playback device for playing back a 

15 distributed audio signal, where the personalization is detected and used as a watermark, 
which is identified and interpreted before playing back the distributed audio. 
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1 . A method for a service provider to distribute an audio signal to a listener, 
where, before the listener gets access to the audio signal, the audio impression of the audio 
signal is personalized to said listener, said personalizing being performed by filtering said 
audio signal using a set of head related transfer functions comprising listener parameters 

5 being specific for said listener. 

2. A method according to claim 1, wherein the personalization is performed 
before distributing said audio signal to said listener. 

10 3. A method according to claim 1-2, wherein the listener parameters in the set of 

head related transfer functions have been chosen between a number of sets of listener 
parameters being specific for said listener. 

4. A method according to any of the claims 1-3, wherein the head related transfer 
1 5 functions have been modified in an substantially inaudible way, where said modification is 

performed by embedding information into the set of head related transfer functions before 
filtering the audio signal. 

5. A method of playing back a distributed audio signal, wherein the audio 

20 impression of the audio signal has been changed according to first listener parameters being 
specific for a specific listener, comprising the steps of: 

detecting said first listener parameters used for changing the audio impression of said 
audio signal, 

- comparing said detected first listener parameters with second listener parameters and 
25 - playing back said audio signal if said detected first listener parameters identify a listener 
being identical to the listener identified by said second listener parameters. 



6. 



A method according to claim 5, wherein the steps of: 
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- detecting said first listener parameters used for changing the audio impression of said 
audio signal and 

- comparing said detected first listener parameters with second listener parameters, 

are performed by comparing the audio signal that has been changed according to said first 
5 listener parameters with a corresponding audio signal having been changed according to said 
second listener parameters. 

7. A method according to claim 5 or 6, wherein first and second listener 

paramctcrs-arc-pajamctcrs-to-bo-used in a sot of - hcad - relatcd - transfcr functions, and - wherei n 

40 the^udie^i^al4ias-been^ 

functions having listener parameters being specific for a specific listener. 

8. A method according to claim 7, wherein the steps of: 

- detecting said first listener parameters used for changing the audio impression of said 
15 audio signal and 

- comparing said detected first listener parameters with second listener parameters stored 
locally 

are performed by comparing the frequency spectrum of the audio signal having been filtered 
by the set of head related transfer functions having said first listener parameters, and the 
20 frequency spectrum of a set of head related transfer functions having said second listener 
parameters. 

9. A method according to claim 6-8, further comprising the step of: ' 

- detecting information having been embedded into the head related transfer function 
25 before filtering the audio signal and, if the detected first listener parameters identify a 

listener being identical to the listener identified by said second listener parameters, 
playing back the audio signal according to the detected information. 

10. An apparatus for playing back a distributed audio signal, wherein the audio 
30 impression of the audio signal has been changed according to first listener parameters being 

specific for a specific listener, comprising: 

- means for detecting said first listener parameters used for changing the audio impression 
of said audio signal, 



13 09.04.2002 
means for comparingsaid detected first listener parameters with second listener 
parameters and 

means for playing back said audio signal if said detected first listener parameters identify 
a listener being identical to the listener identified by said second listener parameters- 
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The present invention relates to a method for a service provider to distribute an 
audio signal to a listener. Before delivering the audio signal to the listener, the audio 
impression of the audio signal is personalized to the listener. The personalizing could e.g. be 
performed by filtering the audio signal using a set of head related transfer functions (HRTFs) 
5 defined by parameters being specific for the listener. This is for minimizing the interest of 
performing illegal copying of the audio signal. The invention further relates to a method of 
playing back a distributed audio signal and a playback device for playing back a distributed 
audio signal, where the personalization is detected and used as a watermark, which is 
identified and interpreted before playing back the distributed audio. 

10 
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